Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 192
Filtrar
1.
J Voice ; 2024 Apr 16.
Artigo em Inglês | MEDLINE | ID: mdl-38631941

RESUMO

OBJECTIVE: This study explored electrophysiological changes in the laryngeal motor neuropathway and determined whether lesions in the laryngeal motor cortex (LMC) and its descending tract contribute to voice deterioration and peripheral nerve palsy in patients with nasopharyngeal carcinoma (NPC) postradiotherapy (RT). STUDY DESIGNS: Prospective cohort study. METHODS: Twenty-two patients with NPC at 2 to 4years post-RT (8 female and 14 male), 22 patients with NPC at 8 to 10years post-RT (8 female and 14 male), and 22 healthy individuals (9 female and 13 male) were selected to test their magnetic evoked potentials (MEP), motor nerve conduction, and voice quality using transcranial magnetic stimulation, laryngeal electromyography, and the XION DiVAS acoustic analysis software. Three groups were matched according to approximate age. Multiple comparisons were performed among the three groups. RESULTS: The voice quality of post-RT patients with NPC deteriorated compared to that of healthy individuals. Bilateral LMC and their corticonuclear tracts to the bilateral ambiguous nuclei of post-RT patients with NPC were impaired according to multigroup comparisons of MEP amplitudes, latencies, and resting motor thresholds. The vagus and recurrent laryngeal nerves (RLN) of post-RT patients with NPC were impaired according to multigroup comparisons of the amplitude and latencies of the compound muscle action potential and latencies of f-waves. CONCLUSIONS: The voice quality of patients with NPC deteriorated after RT. The pathogenesis of post-RT voice deterioration may involve radiation-induced injuries to the vagus, RLN, and bilateral LMC. Furthermore, radiation-induced injuries to the bilateral LMC may contribute to vagus and RLN palsies. These findings support the use of transcranial approaches to treating voice disorders and peripheral nerve palsies in post-RT patients with NPC. TRIAL REGISTRATION: ChiCTR2100054425; Electrophysiological Study of Vocal-Fold Mobility Disorders After Radiotherapy for NPC Patients via Magnetic Evoked Potential and Their Correlation with Voice Quality Assessment; https://www.chictr.org.cn/bin/project/edit?pid=144429.

2.
J Voice ; 2024 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-38538409

RESUMO

Vocal tics can occur in neuropsychiatric disorders and result in familial distress. Management is challenging, particularly in children with developmental delay. A 5-year-old with cerebral dysgenesis presented with a high amplitude, high-frequency vocal tic. Type II thyroplasty with bilateral cricothyroid muscle myectomy was performed after initial botulinum toxin trial. Amount, volume, and pitch of tics significantly decreased, without change in swallow. Benefits persisted at 1-year follow-up. This is the first description of combined type II thyroplasty with cricothyroid myectomy for high-frequency vocal tic. This can be done safely and provide sustained benefit for a rare, impactful voice disorder.

3.
J Voice ; 2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-38195337

RESUMO

OBJECTIVES: Practitioners rely heavily on flexible endoscopic visualization of the true vocal folds during a repeated "sniff-ee" maneuver to assess vocal fold mobility. However, the human eye lacks the temporal and spatial precision required to accurately gauge fine differences in maximal glottal angle. This study compared differences in maximal glottal angle variables during "sniff-ee" maneuvers across patients with various voice and laryngeal breathing disorders. METHODS: We retrospectively measured glottal angle from flexible laryngoscopy examinations in six groups of patients with voice and upper airway disorders: laryngeal dystonia/essential tremor (LD/ET), vocal fold lesions, vocal fold atrophy, paradoxical vocal fold motion disorder (PVFMD), muscle tension dysphonia (MTD), and healthy controls. Maximum glottal angle (GAMAX) and average glottal angle (GAAVG) were calculated during three serial "sniff-ee" maneuvers for all participants. Individual disorder groups (MTD, PVFMD, LD/ET, atrophy, and lesion) and broader disorder types (functional and organic) were compared to healthy controls using simple linear regression analyses. RESULTS: No significant difference in either GAMAX or GAAVG was found between controls and the disorder subgroups or broader disorder type (function and organic). However, there were statistically significant differences in the variability of GAMAX in both PVFMD (6.2° more variability; P < 0.001) and LD/ET (5.8° more variability; P < 0.001) compared to healthy controls. CONCLUSION: Patients diagnosed with LD/ET and PVFMD both demonstrated significantly more variability in their GAMAX compared to healthy controls, suggesting that movement consistency or coordination may be relatively compromised in these patient groups. Further research is warranted to investigate the sensitivity and specificity of glottal angle variability in diagnosing PVFMD and LD in clinical or research settings. LEVEL OF EVIDENCE: 4 SHORT SUMMARY: Laryngeal examinations from five patient groups were compared to those from healthy controls. Patients with paradoxical vocal fold motion disorder and laryngeal movement disorders exhibited significantly greater variability of glottal angle during sniff maneuver compared to healthy controls.

4.
J Laryngol Otol ; 138(2): 224-231, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37334556

RESUMO

OBJECTIVE: The main purpose of this study was to retrospectively evaluate the efficiency of DoctorVox voice therapy in psychogenic dysphonia or aphonia patients, and to share the mid- to long-term results of the method. METHODS: The study was carried out on patients who underwent DoctorVox voice therapy for psychogenic dysphonia or aphonia between January 2015 and September 2019. The evaluation methods used were: the Voice Handicap Index-10; the grade, roughness, breathiness, asthenia and strain ('GRBAS') scale; and videolaryngostroboscopy recordings. RESULTS: The mean Voice Handicap Index-10 values of the patients were 30.91 ± 2.97 before treatment, 8.14 ± 3.82 after treatment, and 3.36 ± 1.78 in the final follow-up examination. The grade, roughness, breathiness, asthenia and strain scale scores were: 9 ± 0.67 pre-treatment, 0.78 ± 0.80 post-treatment, and 0.57 ± 0.64 at the final follow up. CONCLUSION: DoctorVox voice therapy seems to be an efficient treatment method for psychogenic dysphonia or aphonia; it helps develop phonatory muscle functions, using multidimensional biofeedback mechanisms, and increases the patients' therapy adherence.


Assuntos
Disfonia , Humanos , Disfonia/diagnóstico , Afonia , Estudos Retrospectivos , Astenia , Qualidade da Voz
5.
Am J Otolaryngol ; 45(2): 104139, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38101138

RESUMO

INTRODUCTION: Mutational voice disorder is the inability of the voice to adjust to the changes in the larynx during puberty, resulting in the speaking fundamental frequency failing to decrease. Standard treatments for mutational voice disorder are voice therapy and thyroplasty. However, voice therapy takes time to show its effects, and thyroplasty is highly invasive. Herein, we present a case of mutational voice disorder successfully treated with intracordal trafermin injection. CASE SUMMARY: A 31-year-old male patient was diagnosed with mutational voice disorder and offered standard treatment, but he requested a less invasive treatment with early effects. We performed intracordal trafermin injection with his consent. Two months after the procedure, the speaking fundamental frequency decreased from 155.5 Hz to 93.0 Hz, and the voice handicap index decreased from 14 to 2. DISCUSSION: This case suggests that intracordal trafermin injection is an effective treatment option for mutational voice disorder. Furthermore, compared with the standard treatment methods, it is less invasive and provides effects shortly with only one injection.


Assuntos
Fatores de Crescimento de Fibroblastos , Fragmentos de Peptídeos , Distúrbios da Voz , Voz , Masculino , Humanos , Adulto , Distúrbios da Voz/tratamento farmacológico , Distúrbios da Voz/cirurgia , Resultado do Tratamento , Injeções
6.
J. appl. oral sci ; 32: e20230296, 2024. tab, graf
Artigo em Inglês | LILACS-Express | LILACS | ID: biblio-1550475

RESUMO

Abstract Research would be important for obtaining a better understanding of voice complaints among patients with temporomandibular disorders (TMD). Objective The identification of predictors of voice disorders associated with TMD pain was made according to Diagnostic Criteria for TMD (DC/TMD) Axis I. Methodology Functional limitations were measured using the Jaw Functional Limitation Scales for mastication (JFLS-M), jaw mobility (JFLS-JM), and verbal and emotional expression (JFLS-VEE). Patients also completed the Hospital Anxiety and Depression Scale (HADS). The primary outcome was social-emotional and physical functioning as indicated by scores on the Voice-Related Quality of Life (V-RQOL) questionnaire. Multiple linear regression was used to model the relationship between the domains on the V-RQOL questionnaire and scores on the HADS and JFLS after adjusting for age, gender, DC/TMD diagnosis, pain intensity, and time since pain onset. Results The HADS-D (B=-1.15; 95% CI, -1.718 to -0.587; p<.001) and JFLS-VEE (B=-0.22; 95% CI, -0.40 to -0.06; p=.008) were significant predictors of scores on the V-RQOL questionnaire. Conclusion Higher scores on depression measures and limitations in verbal and emotional expression could exacerbate voice problems among TMD pain patients. Future research should promote multidisciplinary treatments for TMD pain-related voice disorders.

7.
J Voice ; 2023 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-38044169

RESUMO

OBJECTIVES: The purpose of the present study was to examine the effect of adding a cricothyroid visor maneuver to conventional voice-facilitating techniques on voice quality and reported symptoms in patients with primary muscle tension dysphonia. METHODS: This was a double-blind two parallel-group clinical trial in which 20 adult patients participated through convenience sampling. Participants were allocated to intervention (combined treatment) and control (conventional treatment) groups. The intervention was performed for both groups for five sessions, twice a week. The two groups were compared after the intervention for primary outcome measures including maximum phonation time, jitter, shimmer, harmonic-to-noise ratio, and consensus auditory-perceptual evaluation of voice, and for secondary outcome measures including the voice handicap index, the voice activity and participation profile, the voice-related pain scale, and the vocal tract discomfort scale. RESULTS: Within-group primary outcome comparison showed that both groups showed significant improvement in maximum phonation time and consensus auditory-perceptual evaluation of voice indices after treatment. the between-group comparison showed that the maximum phonation time increased significantly in the intervention group after the treatment (P = 0.03) and the effect size was large (es = 1.05). within-group secondary outcome comparison showed that all indices improved significantly in both groups except for the voice-related pain scale frequency. The between-group comparison showed that except for voice-related pain scale frequency, the intervention group reported significantly more improvement in all other self-reporting indices with a large effect size. CONCLUSIONS: The study showed that adding cricothyroid visor maneuver to conventional voice-facilitating techniques, compared to conventional treatment alone, resulted in a significant increase in maximum phonation time, reduction in pain and vocal tract discomfort, increase in activity and participation, and improvement in voice-handicapped index in primary muscle tension dysphonia patients. Therefore adding cricothyroid visor maneuver to other treatments can be an effective method in improving primary muscle tension dysphonia which needs more studies in the future.

8.
J Voice ; 2023 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-38065808

RESUMO

PURPOSE: The purpose of this study was to understand the role of implicit racial bias in auditory-perceptual evaluations of dysphonic voices by determining if a biasing effect exists for novice listeners in their auditory-perceptual ratings of Black and White speakers. METHOD: Thirty speech-language pathology graduate students at Boston University listened to audio files of 20 Black speakers and 20 White speakers of General American English with voice disorders. Listeners rated the overall severity of dysphonia of each voice heard using a 100-unit visual analog scale and completed the Harvard Implicit Association Test (IAT) to measure their implicit racial bias. RESULTS: Both Black and White speakers were rated as less severely dysphonic when their race was labeled as Black. No significant relationship was found between Harvard IAT scores and differences in severity ratings by race labeling condition. CONCLUSIONS: These findings suggest a minimizing bias in the evaluation of dysphonia for Black patients with voice disorders. These results contribute to the understanding of how a patient's race may impact their visit with a clinician. Further research is needed to determine the most effective interventions for implicit bias retraining and the additional ways that implicit racial bias impacts comprehensive voice evaluations.

9.
J Voice ; 2023 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-38142188

RESUMO

OBJECTIVES: The aim of the study was to investigate whether the vocal self-concept and potential vocal and related mental health problems of university teachers and academic advisers can be improved by participating in preventive voice training. STUDY DESIGN AND METHODS: University teachers and academic advisers show an increased susceptibility to occupational risk factors related to their voice and an increased prevalence of developing a voice disorder in the course of their employment. An experimental, prospective, longitudinal study was conducted to examine whether voice training (1) improves physiological vocal function, vocal performance, vocal self-concept, and mental health of university teachers and academic advisers; (2) reduces unfavorable influencing factors at the university workplace; and (3) fewer voice problems are reported from the perspective of university teachers and academic advisers. RESULTS: This study showed that voice training has positive influences on the voice (function and quality), voice self-concept, and well-being of university staff. CONCLUSIONS: Workplace prevention programs can help to reduce the high prevalence of voice disorders among university teachers and advisers and counteract the risk factors. They should therefore be firmly integrated into continuing education/university health management and everyday life to cope with certain work-related vocal stresses and to maintain psychological and vocal well-being throughout the university career. Gender and age-related aspects should be considered.

10.
J Voice ; 2023 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-37957070

RESUMO

OBJECTIVES/HYPOTHESIS: The Glottal Function Index (GFI) is a four-item self-administered survey suitable for the evaluation and treatment of patients with glottal dysfunction. To date, it has been translated into Lithuanian, Persian, and Hebrew. This study is intended to translate and cross-culturally adapt the GFI for use in Arabic-speaking patients with dysphonia. STUDY DESIGN: This work is a cross-sectional study involving the administration of the GFI to participants with dysphonia (cases) and patients without dysphonia (controls). The validation process included reliability and validity assessments. METHODS: The GFI was translated using forward and backward translation methods from English into Arabic. The questionnaire's reliability was assessed using Cronbach's alpha and test-retest reliability (intraclass correlation coefficient, ICC). The Mann-Whitney test evaluated validity by comparing cases and controls. Finally, the Kruskal-Wallis test examined differences in the GFI across various pathologies. RESULTS: The GFI demonstrated favorable internal consistency (Cronbach's alpha = 0.848) and excellent test-retest reliability (ICC = 0.993). Significant differences in the A-GFI score between the cases and controls were also observed (P < 0.001), supporting the instrument's validity. However, no statistically significant differences were found in A-GFI across different diagnoses (P = 0.712). CONCLUSIONS: The A-GFI is a valid and reliable screening tool for clinicians to assess dysphonia and voice impairment in patients in Arabic-speaking countries. The tool is easy to administer in daily clinical practice given its brevity and self-administration.

11.
Laryngoscope ; 2023 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-37800700

RESUMO

OBJECTIVE: Laryngoplasty requires the manipulation of the vocal folds, which are not visible during the operation. The widespread use of this technique is limited by the need for adequate knowledge of anatomy, the small surgical field, and the high level of skill required for the procedure. An exoscope has been developed to provide a stereoscopic view similar to that of a microscope while using the same compact endoscopic tool. This study aimed to determine whether the three-dimensional (3D) exoscopic surgical technique could be applied to laryngoplasty and explore its possibility to ultimately replace the current approach. METHODS: This was a retrospective case series analysis, which included 28 patients with hoarseness who underwent surgery with (Exoscope; n = 12) or without (Macrosurgery; n = 16) a 3D exoscope between July 2018 and February 2021. The feasibility of performing all surgical steps with the 3D exoscope was evaluated. The Exoscope and Macrosurgery groups were compared for surgical time, vocal function outcomes, and complications. Questionnaires were completed by medical staff regarding the usefulness of medical education. RESULTS: No intraoperative or postoperative complications occurred in either procedure. The operative time was similar in both groups. The vocal function outcomes were also comparable between the groups. Questionnaires revealed that the exoscope was useful in terms of sharing information on surgical procedures and anatomy, as well as functioning as an educational tool. CONCLUSION: While this was a preliminary study, our results indicated that the exclusive use of the 3D exoscope was feasible for open approaches. LEVEL OF EVIDENCE: Level IV Laryngoscope, 2023.

12.
J Voice ; 2023 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-37867071

RESUMO

OBJECTIVES: The purpose of this paper is to review seminal identity theories grounded in social psychology and one concept from voice science and explain how this group may point to identity factors facilitating or impeding voice habilitation and rehabilitation. METHODS: Identity theories from the social psychology literature (Dramaturgical Theory, Self-Categorization Theory, Self-Determination Theory, Identity Negotiation Theory) and vocal congruence are described. Concepts are synthesized with voice science research to explore potential identity-behavior relations at play in voice habilitation and rehabilitation. RESULTS: Applicable concepts from social psychology and voice science suggest identity-related processes by which a client may or may not develop a voice difference/disorder, seek intervention, and achieve goals in intervention. A bidirectional relationship between identity and behavior has been well-established in the social psychology literature. However, the relevance of vocal behavior has yet to be formally examined within this literature. Importantly, although connections between behavioral tendencies and voice disorders as well as the contribution of identity to gender-affirming voice treatment have been established in the voice science literature, the consideration of identity's possible role in voice habilitation and rehabilitation in cis gender individuals has thus far been scant. CONCLUSIONS: Research into identity and voice habilitation and rehabilitation may help to improve voice intervention outcomes. A possible adjunct to human studies is agent-based modeling or other computational approaches to assess the myriad factors that may be relevant within this line of inquiry.

13.
J Voice ; 2023 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-37891129

RESUMO

The incidence rate of voice diseases is increasing year by year. The use of software for remote diagnosis is a technical development trend and has important practical value. Among voice diseases, common diseases that cause hoarseness include spasmodic dysphonia, vocal cord paralysis, vocal nodule, and vocal cord polyp. This paper presents a voice disease detection method that can be applied in a wide range of clinical. We cooperated with Xiangya Hospital of Central South University to collect voice samples from 352 different patients. The Mel Frequency Cepstrum Coefficient (MFCC) parameters are extracted as input features to describe the voice in the form of data. An innovative model combining MFCC parameters and single convolution layer CNN is proposed for fast calculation and classification. The highest accuracy we achieved was 92%, it is fully ahead of the original research results and internationally advanced. And we use advanced voice function assessment databases (AVFAD) to evaluate the generalization ability of the method we proposed, which achieved an accuracy rate of 98%. Experiments on clinical and standard datasets show that for the pathological detection of voice diseases, our method has greatly improved in accuracy and computational efficiency.

14.
J Voice ; 2023 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-37778959

RESUMO

OBJECTIVE: To analyze the internal consistency of the Voice Handicap Index (VHI) for evaluating the vocal handicap of individuals with dysphonia. METHODS: This is a systematic review. Studies with a cross-sectional design and including a population of individuals with dysphonia, which validated the VHI and analyzed its internal consistency, were included. The following electronic databases were searched: Cochrane Library, EMBASE, LILACS, and PubMed, including Medline, Scopus, and Web of Science. A manual search was performed in gray literature through the Biblioteca Digital Brasileira de Teses e Dissertações and ProQuest Dissertation & Theses. In addition, the list of references of studies selected in the electronic search was mapped, and an expert in the area was consulted. Two reviewers blindly and independently conducted the selection, data extraction, and analysis of the risk of bias, the certainty of the evidence, and good psychometric measures. A meta-analysis was performed with a random effects model using the JAMOVI 2.3.2 software. RESULTS: Forty-nine studies were analyzed. In risk of bias assessment, the studies were classified as having inadequate structural validity and very good internal consistency. The analysis of good psychometric properties indicated indeterminate structural validity and insufficient internal consistency. The overall value of Cronbach's alpha was estimated at 0.94, thus suggesting a very good internal consistency. However, there was high heterogeneity. The level of certainty of the evidence was too low for internal consistency. CONCLUSION: The VHI proved to be a consistent and reliable patient-reported outcome measure to evaluate voice handicap in individuals with dysphonia; however, studies are heterogeneous, and the certainty of evidence is very low.

15.
Neurol Clin ; 41(4): 635-646, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37775195

RESUMO

Historically, formal training for speech-language therapists (SLTs) in the area of functional speech and voice disorders (FSVD) has been limited, as has the body of empirical research in this content area. Recent efforts in the field have codified expert opinions on best practices for diagnosing and treating FSVD and have begun to demonstrate positive treatment outcomes. To provide comprehensive interventions for these complex conditions at the intersection of neurology, psychiatry, and other medical specialties, the SLT must not only build knowledge of diagnostic strategies and components of symptomatic treatment in FSVD but also embrace behavior change techniques and counseling strategies.


Assuntos
Fala , Distúrbios da Voz , Humanos , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/terapia , Resultado do Tratamento
16.
J Voice ; 37(5): 648-662, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37717981

RESUMO

OBJECTIVE: Speech signal processing has become an important technique to ensure that the voice interaction system communicates accurately with the user by improving the clarity or intelligibility of speech signals. However, most existing works only focus on whether to process the voice of average human but ignore the communication needs of individuals suffering from voice disorder, including voice-related professionals, older people, and smokers. To solve this demand, it is essential to design a non-invasive repair system that processes pathological voices. METHODS: In this paper, we propose a repair system for multiple polyp vowels, such as /a/, /i/ and /u/. We utilize a non-linear model based on amplitude-modulation (AM) and a frequency-modulation (FM) structure to extract the pitch and formant of pathological voice. To solve the fracture and instability of pitch, we provide a pitch extraction algorithm, which ensures that pitch's stability and avoids the errors of double pitch caused by the instability of low-frequency signal. Furthermore, we design a formant reconstruction mechanism, which can effectively determine the frequency and bandwidth to accomplish formant repair. RESULTS: Finally, spectrum observation and objective indicators show that the system has better performance in improving the intelligibility of pathological speech.


Assuntos
Distúrbios da Voz , Voz , Humanos , Idoso , Fala , Distúrbios da Voz/diagnóstico , Algoritmos , Cognição
17.
Turk Arch Otorhinolaryngol ; 61(2): 66-74, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37727815

RESUMO

Objective: Vocal fold nodules (VFNs) are among the most common causes of dysphonia. Phono-laryngeal microsurgery, pharmacological treatments, and voice therapy (VT) have been used for treating VFNs. VT has been advocated as the primary treatment of choice. This study investigated the efficacy of the DoctorVox Voice therapy technique (DVT) for treating VFNs. Methods: A total of 38 patients with VFNs and 40 individuals without any voice problem (control group) were included. All patients received the DVT program. Otorhinolaryngology examination, videolaryngostroboscopy (VLS), and acoustic analysis (SPL, mean F0, jitter %, shimmer %, NHR) were performed at pretreatment, one and six months after the end of treatment. The voice handicap index-10 (VHI-10) and the GRB scales were used for perceptual voice evaluation. GRB and VLS scorings were done blindly. Results: Compared with the pretreatment values, the first- and the sixth-month values after treatment demonstrated a significant decrease in VHI-10 (19.5 vs. 5.1), GRB (2.3 vs 0.68 for G value) and VLS scores, SPL (54.4 vs 66.1 dB), F0 (201 vs. 227 Hz), jitter % (1.46 vs 0.85), shimmer % (3.27 vs 2.51), NHR (1.15 vs. 0.46) values among patients. Most of the voice parameters in the sixth month after the DVT program did not differ significantly from those of the control group. Conclusion: The DVT was found to be an effective method in VFN treatment.

18.
Diagnostics (Basel) ; 13(18)2023 Sep 06.
Artigo em Inglês | MEDLINE | ID: mdl-37761237

RESUMO

Vocal arts medicine provides care and prevention strategies for professional voice disorders in performing artists. The issue of correct "Fach" determination depending on the presence of a lyric or dramatic voice structure is of crucial importance for opera singers, as chronic overuse often leads to vocal fold damage. To avoid phonomicrosurgery or prevent a premature career end, our aim is to offer singers an improved, objective fach counseling using digital sound analyses and machine learning procedures. For this purpose, a large database of 2004 sound samples from professional opera singers was compiled. Building on this dataset, we employed a classic ensemble learning method, namely the Random Forest algorithm, to construct an efficient fach classifier. This model was trained to learn from features embedded within the sound samples, subsequently enabling voice classification as either lyric or dramatic. As a result, the developed system can decide with an accuracy of about 80% in most examined voice types whether a sound sample has a lyric or dramatic character. To advance diagnostic tools and health in vocal arts medicine and singing voice pedagogy, further machine learning methods will be applied to find the best and most efficient classification method based on artificial intelligence approaches.

19.
J Voice ; 2023 Jun 09.
Artigo em Inglês | MEDLINE | ID: mdl-37302909

RESUMO

OBJECTIVES: This paper reports the effectiveness of formant-aware spectral parameters to predict the perceptual breathiness rating. A breathy voice has a steeper spectral slope and higher turbulent noise than a normal voice. Measuring spectral parameters of acoustic signals over lower formant regions is a known approach to capture the properties related to breathiness. This study examines this approach by testing the contemporary spectral parameters and algorithms within the framework, alternate frequency band designs, and vowel effects. METHODS: Sustained vowel recordings (/a/, /i/, and /u/) of speakers with voice disorders in the German Saarbrueken Voice Database were considered (n: 367). Recordings with signal irregularities, such as subharmonics or with roughness perception, were excluded from the study. Four speech language pathologists perceptually rated the recordings for breathiness on a 100-point scale, and their averages were used in the analysis. The acoustic spectra were segmented into four frequency bands according to the vowel formant structures. Five spectral parameters (intraband harmonics-to-noise ratio, HNR; interband harmonics ratio, HHR; interband noise ratio, NNR; and interband glottal-to-noise energy, GNE, ratio) were evaluated in each band to predict the perceptual breathiness rating. Four HNR algorithms were tested. RESULTS: Multiple linear regression models of spectral parameters, led by the HNRs, were shown to explain up to 85% of the variance in perceptual breathiness ratings. This performance exceeded that of the acoustic breathiness index (82%). Individually, the HNR over the first two formants best explained the variances in the breathiness (78%), exceeding the smoothed cepstrum peak prominence (74%). The performance of HNR was highly algorithm dependent (10% spread). Some vowel effects were observed in the perceptual rating (higher for /u/), predictability (5% lower for /u/), and model parameter selections. CONCLUSIONS: Strong per-vowel breathiness acoustic models were found by segmenting the spectrum to isolate the portion most affected by breathiness.

20.
Sensors (Basel) ; 23(11)2023 May 30.
Artigo em Inglês | MEDLINE | ID: mdl-37299922

RESUMO

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.


Assuntos
Disfonia , Voz , Humanos , Disfonia/diagnóstico , Acústica da Fala , Qualidade da Voz , Medida da Produção da Fala/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...